A Comparative Assessment of Text-independent Automatic Speaker Identification Methods Using Limited Data

نویسندگان

چکیده

Automatic Speaker Identification (ASI) is one of the active fields research in signal processing. Various machine learning algorithms have been used for this purpose. With recent developments hardware technologies and data accumulation, Deep Learning (DL) methods become new state-of-the-art approach several classification identification tasks. In paper, we evaluate performance traditional such as Gaussian Mixture Model-Universal Background Model (GMM-UBM) DL-based techniques Factorized Time-Delay Neural Network (FTDNN) Convolutional Networks (CNN) text-independent closed-set automatic speaker on two datasets with different conditions. LibriSpeech experimental datasets, which consists clean audio signals from audiobooks, collected a large number speakers. The other dataset was prepared by us, has rather limited speech low signal-to-noise-ratio real-life conversations customers agents call center. duration query phase an important factor affecting performances ASI methods. work, CNN architecture proposed short segments. design aims at capturing temporal nature optimum convolutional neural network parameters compared to well-known architectures. We show that CNN-based algorithm performs better dataset, whereas amount data, method outperforms all DL approaches. achieved top-1 accuracy model 99.5% 1-second voice instances dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Independent Speaker Identification Using Automatic Acoustic Segmentation

This paper describes an acoustic class dependent technique for text independent speaker identification on very short utterances. The technique is based on maximum likelihood estimation of a Gaussian mixture model representation of speaker identity. Gaussian mixtures are noted for their robustness as a parametric model and their ability to form smooth estimates of rather arbitrary underlying den...

متن کامل

Text-Independent Speaker Identification

Speaker identification is a difficult task, and the task has several different approaches. The state of the art for speaker identification techniques include dynamic time warped(DTW) template matching, Hidden Markov Modeling(HMM), and codebook schemes based on vector quantization(VQ)[2]. In this project, the vector quantization approach will be used, due to ease of implementation and high accur...

متن کامل

Text-Independent Speaker Identification Using Temporal Patterns

In this work we present an approach for text-independent speaker recognition. As features we used Mel Frequency Cepstrum Coefficients (MFCCs) and Temporal Patterns (TRAPs). For each speaker we trained Gaussian Mixture Models (GMMs) with different numbers of densities. The used database was a 36 speakers database with very noisy close-talking recordings. For the training a Universal Background M...

متن کامل

Robust text-independent speaker identification using Gaussian mixture speaker models

This paper introduces and motivates the use of Gaussian mixture models (CMM) for robust text-independent speaker identification. The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are efTective for modeling speaker identity. The focus of this work is on applications which require high identification rates using short utterance ...

متن کامل

Limited data speaker identification

In this paper, the task of identifying the speaker using limited training and testing data is addressed. Speaker identification system is viewed as four stages namely, analysis, feature extraction, modelling and testing. The speaker identification performance depends on the techniques employed in these stages. As demonstrated by different experiments, in case of limited training and testing dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Europan journal of science and technology

سال: 2021

ISSN: ['2148-2683']

DOI: https://doi.org/10.31590/ejosat.950218